Supporting temporal question answering: strategies for offline data collection
نویسندگان
چکیده
We pursue two strategies for offline data collection for a temporal question answering system that uses both quantitative methods and fuzzy methods to reason about time and events. The first strategy extracts event descriptions from the structured year entries in the online encyclopedia Wikipedia, yielding clean quantitative temporal information about a range of events. The second strategy mines the web using patterns indicating temporal relations between events and times and between events. Web mining leverages the volume of data available on the web to find qualitative temporal relations between known events and new, related events and to build fuzzy time spans for events for which we lack crisp metric temporal information.
منابع مشابه
Developing Offline Strategies for Answering Medical Questions
We describe ongoing developments on two offline strategies for automatically answering questions in the medical domain: one based on an analysis of the document structure, the other based on dependency parsing. We highlight differences with open domain question answering, and provide a preliminary evaluation of the current state of our strategies.
متن کاملBoosting Passage Retrieval through Reuse in Question Answering
Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...
متن کاملPreprocessing Documents to Answer Dutch Questions
We describe a framework for offline extraction of certain types of information from a document collection, and discuss its usage for answering factoid questions. We implemented this approach as a part of the Dutch Question Answering System developed at the University of Amsterdam. The evaluation of the system using data from the CLEF 2003 Question Answering track shows that our strategy yields ...
متن کاملOffline Strategies for Online Question Answering: Answering Questions Before They Are Asked
Recent work in Question Answering has focused on web-based systems that extract answers using simple lexicosyntactic patterns. We present an alternative strategy in which patterns are used to extract highly precise relational information offline, creating a data repository that is used to efficiently answer questions. We evaluate our strategy on a challenging subset of questions, i.e. “Who is ....
متن کاملVidiam: Corpus-based Development of a Dialogue Manager for Multimodal Question Answering
In this chapter we describe the Vidiam project, which concerns the development of a dialogue management system for multi-modal question answering dialogues as it was carried out in the IMIX project. The approach that was followed is data-driven, that is, corpus-based. Since research in Question Answering Dialog for multi-modal information retrieval is still new, no suitable corpora were availab...
متن کامل